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PEPTIDE LIBRARY AND SCREENING SYSTEMS 



Field of the Invention 

The present invention relates generally to 
methods for selecting peptide ligands to receptor 
molecules of interest and, more particularly, to methods 
for generating and screening large peptide libraries for 
peptides with desired binding characteristics. 

Background of the Invention 

As molecular biology has helped to define 
regions of proteins that contribute to a particular 
biological activity, it has become desirable to 
synthesize short peptides to mimic (or inhibit) those 
activities. Many of the disadvantages encountered in 
therapeutic, diagnostic and industrial settings with 
purified proteins, or those produced by recombinant 
means, could easily be avoided by short synthetic 
peptides. For instance, synthetic peptides offer 
advantages of specificity, convenience of sample or bulk 
preparation, lover relative cost, high degree of purity, 
and long shelf -life. 

Despite the great promise of synthetic 
peptides, the technology remains, to a large extent, a 
laboratory tool. Precise sequence and binding data are 
not available for most proteins of significant medical, 
agricultural or industrial interest. Even when the 
sequence of a protein is known, the process of 



identifying short sequences which are responsible for or 
contribute to a biological activity may be extremely 
tedious, if not nearly impossible in many instances. 

Thus, the ability to generate and efficiently 
screen very large collections of peptides for desired 
binding activities vould be of enormous interest. It 
would enable the identification of novel agonists and 
antagonists for receptors, the isolation of specific 
inhibitors of enzymes, provide probes for structural and 
functional analyses of binding sites of many proteins, 
and ligands for many other compounds employed in a wide 
variety of applications. 

The generation of large numbers of peptide 
sequences by the cloning and expression of randomly- 
generated mixtures of oligonucleotides is possible in the 
appropriate recombinant vectors. See, e.g., Oliphant et 
al., Gene 44:177-183 (1986). Such a large number of 
compounds can be produced, however, that methods for 
efficient physical and genetic selection are required. 
Without such methods the usefulness of these large 
peptide libraries in providing ligands of potential 
interest may be lost. The present invention provides 
methods for efficient screening and selection from a 
large peptide library, fulfilling these and other related 
needs • 



Summary of the Invention 

The present invention provides novel methods 
and compositions for identifying peptides which bind to 
preselected receptor molecules. The peptides find a 
variety of therapeutic, diagnostic and related uses, 
e.g., to bind the receptor or an analogue thereof and 
inhibit or promote its activity. 

In one embodiment the invention relates to 
methods for identifying the peptides which bind to, a 
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presel cted receptor. In certain aspects the methods 
generally comprise c instructing a bacteriophage 
expression vect r which comprises an oligonucleotide 
library of at least about 10 6 members which encode the 
peptides. The library member is joined in reading frame 
to the 5 f region of a nucleotide sequence encoding an 
outer structural protein of the bacteriophage. 
Appropriate host cells are transformed with the 
expression vectors, generally by electroporation, and the 
transformed cells cultivated under conditions suitable 
for expression and assembly of bacteriophage. Using an 
affinity screening process, bacteriophage library members 
are contacted with the preselected receptor under 
conditions conducive to specific peptide -recept or 
binding, and bacteriophage whose coat proteins have 
peptides which bind the receptor molecule are selected. 
The nucleotide sequence which encodes the peptide on the 
selected phage may then be determined. By repeating the 
affinity selection process one or more times, the 
peptides of interest may be enriched. By increasing the 
stringency of the selection, e.g., by reducing the 
valency of the peptide-phage interaction towards 
substantial monovalency, peptides of increasingly higher 
affinity can be identified. 

In another aspect the methods are concerned 
with expression vectors having the oligonucleotide 
library members joined in reading frame with a nucleotide 
sequence to encode a fusion protein, wherein the library 
member represents the 5' member of the fusion and the 3' 
member comprises at least a portion of an outer 
structural protein of the bacteriophage. The first 
residue of the peptide encoded by the library member may 
be at the 5 '-terminus of the sequence encoding the phage 
coat protein. In preferred embodiments, where phage 
proteins are initially expressed as preproteins and then 
processed by the host cell to a mature protein, the 
library memb rs are insert d so as to 1 ave the peptide 
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encoded thereby at the N-terminus of the mature phag 
protein after processing or a pr tein substantially 
h mologous ther to* 

The invention also concerns host cells 
5 transformed with a bacteriophage expression vector having 

an oligonucleotide library member, joined in reading 
frame to the 5' region of a nucleotide sequence encoding 
an outer structural protein of the bacteriophage, wherein 
the library member encodes a peptide of at least about 

10 five to twenty- five amino acids. 

Generally, the oligonucleotide library of the 
invention comprises a variable codon region which encodes 
for the peptides of interest, and may optionally comprise 
sequences coding for one or more spacer amino acid 

15 residues, such as Gly. The variable region may be 

encoded by (NNK) x or (NNS) x , where N is A, C, G or T, K is 
G or T, S is G or C, and x is from 5 to at least about 8. 
In certain preferred embodiments the variable region of 
the oligonucleotide library member encodes a hexa- 

20 peptide. The variable codon region may also be prepared 

from a condensation of activated trinucleotides. 



25 Brief Description of the Drawings 

Fig. 1 depicts the construction of an 
oligonucleotide library. (A) The vector fAFFl contains 
two non-complementary BstXI sites separated by a 30 bp 

30 stuff er fragment. Removal of the BstXI fragment allows 

oriented ligation of oligonucleotides with the 
appropriate cohesive ends. (B) The oligonucleotide ON- 
49 was annealed to two "half-site" fragments to form 
cohesive termini complementary to BstXI sites l and 2 in 

35 the vector. The gapped structure, where the single- 

stranded region comprises the variable hexacodon sequence 
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and a 2 (gly) codon spacer* was ligated to the vector and 
electro-transformed into E. coli . 

Fig. 2 depicts the amino acid sequences 
(deduced from DNA sequence) of the N-terminal 
hexapeptides on pill of infectious phage randomly 
selected from the library. Sequences begin at the signal 
peptidase site. Single letter code for amino acids is A 
(Ala), C (Cys), D (Asp), E (Glu) , F (Phe) , G (Gly), H 
(His), I (lie), K (Lys), L (Leu), M (Met), N (Asn) , P 
(Pro), Q (Gin), R (Arg) , S (Ser) , T (Thr) , V (Val) , W 
(Trp), Y (Tyr). 

Fig. 3 illustrates the composite ONA sequence 
of the variable region of pools of (A) infectious phage 
from the library, and (B) phage recovered from 1, 2, or 3 
rounds of panning on mAB 3E7. Phage were amplified as 
tetracycline resistant colonies and DNA from a pool of 
phage derived from several thousand of these colonies was 
Isolated and sequenced. The area of the sequencing gel 
corresponding to the cloning site in genelll is 
displayed. A sequencing primer was annealed to the phage 
DNA -40 bases to the 3' side of the cloning site. The 
actual readout of the gel is the sequence complementary 
to the coding strand. For clarity of codon 
identification, the lanes may be read as C, T, A, G, left 
to right and 5* to 3', top to bottom, to identify the 
sequence of the coding (+) strand. 

Fig. 4 shows the amino acid sequences (deduced 
from DNA sequence} of the N-terminal peptides of pill of 
51 phage isolated by three rounds of panning on mAB 3E7. 

Fig. 5 illustrates the results of phage 
sandwich ELISAs for YGGFL- and YAGFAQ-phage with 
biotinylated monoclonal antibody 3E7 IgG (Fig. 5A) or 3E7 
Fab fragments (Fig. 5B) immobilized at maximal density on 
streptavidin coated wells and labeled polyclonal anti- 
phage antibodies to detect bound phage. 

Fig. 6 illustrates the results of phage 
sandwich ELISAs which compare the effect of 3E7 Fab 
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concentration at 5 nM (Fig. 6A) and 50 pM (Fig. 6B) and 
wash times (minutes) on recoveries of YGGFL- and YAGFAQ- 
phage. 

Fig. 7 shows 3E7 Fab dissociation from phage 
bearing peptides of known affinity, YGGFL and YGFWGM. 

pgscrjptjpn gf Preferrefl EnfeocUiBentp 

Methods and compositions are provided for 
identifying peptides which bind to receptor molecules of 
interest. The peptides are produced from oligonucleotide 
libraries which encode peptides attached to a 
bacteriophage structural protein. A method of affinity 
enrichment allows a very large library of peptides to be 
screened and the phage carrying the desired peptide (s) 
selected. The nucleic acid may then be isolated from the 
phage and the variable region of the oligonucleotide 
library member sequenced, such that the amino acid 
sequence of the desired peptide is deduced therefrom. 
Using these methods a peptide identified as having a 
binding affinity for the desired molecule may then be 
synthesized in bulk by conventional means. 

By identifying the peptide de novo one need not 
know the sequence or structure of the receptor molecule 
or the sequence of its natural binding partner. Indeed, 
for many "receptor" molecules a binding partner has not 
yet been identified. A significant advantage of the 
present invention is that no prior information regarding 
an expected ligand structure is required to isolate 
peptide ligands of interest. The peptide identified will 
thus have biological activity, which is meant to include 
at least specific binding affinity for a selected 
receptor molecule, and in some instances will further 
include the ability to block the binding of other 
compounds, to stimulate r inhibit metabolic pathways, to 
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act as a signal or messenger, to stimulate or inhibit 
cellular activity, and the like. 

The numb r of possible receptor mol cules for 
which peptide ligands may be identified by means of the 
present invention is virtually unlimited. For example, 
the receptor molecule may be an antibody (or a binding 
portion thereof) . The antigen to which the antibody 
binds may be known and perhaps even sequenced, in which 
case the invention may be used to map epitopes of the 
antigen. If the antigen is unknown, such as with certain 
autoimmune diseases, for example, sera or other fluids 
from patients with the disease can be used in the present 
methods to identify peptides, and consequently the 
antigen which elicits the autoimmune response. It is 
also possible using these methods to tailor a peptide to 
fit a particular individual's disease, once a peptide 
has been identified it may itself serve as, or provide 
the basis for, the development of a vaccine, a 
therapeutic agent, a diagnostic reagent, etc. 

The present invention can identify peptide 
ligands for a wide variety of substances in addition to 
antibodies. These include, by way of example and not 
limitation, growth factors, hormones, enzymes, 
interferons, inter leukins f intracellular and 
intercellular messengers, lectins, cellular adhesion 
molecules and the like, as well as the ligands for the 
corresponding receptors of the aforementioned molecules. 
It will be recognized that peptide ligands may also be 
identified by the present invention for molecules which 
are not peptides or proteins, e.g., carbohydrates, non- 
protein organic compounds, metals, etc. Thus, although 
antibodies are widely available and conveniently 
manipulated, they are merely representative of receptor 
molecules for which peptide ligands can be identified by 
means of the present invention. 

An oligonucleotide library, prepared according 
to the criteria as described herein, is inserted in an 
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appropriat vector ncoding a bacteriophage structural 
protein, preferably an accessibl phage prot in, such as 
a bacteriophage coat protein. Although one skilled in 
the art will appreciate that a variety of bacteriophage 
may be employed in the present invention, in preferred 
embodiments the vector is, or is derived from, a 
filamentous bacteriophage, such as, for example, fl, fd, 
Pfl, Ml 3, etc. In a more preferred embodiment the 
filamentous phage is fd, and contains a selectable marker 
such as tetracycline (e.g., "fd-tet") . The fd-tet vector 
has been extensively described in the literature. See, 
for example, Zacher et al., Gene 9:127-140 (1980), Smith 
et al., Science 228:1315-1317 (1985) and Parmley and 
Smith, Gene 73:305-318 (1988), each incorporated by 
reference herein. 

The phage vector is chosen to contain or is 
constructed to contain a cloning site located in the 5* 
region of the gene encoding the bacteriophage structural 
protein, so that the peptide is accessible to receptors 
in an affinity selection and enrichment procedure as 
described below. As the structural phage protein is 
preferably a coat protein, in phage fd the preferred coat 
protein is pill. Each filamentous fd phage is known to 
have up to four or five copies of the pill protein. 

An appropriate vector allows oriented cloning 
of the oligonucleotide sequences which encode the peptide 
so that the peptide is expressed at or within a distance 
of about 100 amino acid residues of the N-terminus of the 
mature coat protein. The coat protein is typically 
expressed as a preprotein, having a leader sequence. 
Thus, desirably the oligonucleotide library is inserted 
so that the N-tenninus of the processed bacteriophage 
outer protein is the first residue of the peptide, i.e., 
between the 3 '-terminus of the sequence encoding the 
leader protein and the 5 '-terminus of the sequence 
encoding the mature protein or a portion of the 5' 
terminus . 
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The library is construct d by cloning an 
oligonucle tide which contains the variable region of 
library m mbers (and any spac rs, framew rk determinants, 
etc. as discussed below) into the selected cloning site. 
Using known recombinant DNA techniques (see generally, 
Sambrooke et al., Molecular Cloning. A Laboratory Manual. 
2d ed . , Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y., 1989, incorporated by reference herein), an 
oligonucleotide may be constructed which, inter alia , 
removes unwanted restriction sites and adds desired ones, 
reconstructs the correct portions of any sequences which 
have been removed (such as a correct signal peptidase 
site, for example) , inserts the spacer conserved or 
framework residues, if any, and corrects the translation 
frame (if necessary) to produce active, infective phage. 
The central portion of the oligonucleotide will generally 
contain one or more of the variable region domain (s) and 
the spacer or framework residues. The sequences are 
ultimately expressed as peptides (with or without spacer 
or framework residues) fused to or in the N-terminus of 
the mature coat protein on the outer, accessible surface 
of the assembled bacteriophage particles. 

The variable region domain of the 
oligonucleotide comprises the source of the library. The 
size of the library will vary according to the number of 
variable codons, and hence the size of the peptides, 
which are desired. Generally the library will be at 
least about 10 6 members, usually at least 10 7 , and 
typically 10* or more members. To generate the collection 
of oligonucleotides which forms a series of codons 
encoding a random collection of amino acids and which is 
ultimately cloned into the vector, a codon motif is used, 
such as (NNK) x , where N may be A, C, G, or T (nominally 
equimolar) , K is G or T (nominally equimolar) , and x is 
typically up to about 5, 6, 7, or 8 or more, thereby 
producing libraries of penta-, hexa-, hepta-, and octa- 
peptides or more. Th third position may also be G or C, 
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designated "S". Thus, NNK r NNS (i) code for all the 
amin acids, (ii) c de for only one stop codon, and (iii) 
reduce the range of codon bias from 6:1 to 3:1. It 
should be understood that with longer peptides the size 
of the library which is generated may become a constraint 
in the cloning process and thus the larger libraries can 
be sampled, as described here inbe low. The expression of 
peptides from randomly generated mixtures of 
oligonucleotides in appropriate recombinant vectors is 
discussed in Oliphant et al., Gene 44:177-183 (1986), 
incorporated herein by reference. 

An exemplified codon motif (NNK) produces 32 
codons, one for each of 12 amino acids, two for each of 
five amino acids, three for each of three amino acids and 
one (amber) stop codon. Although this motif produces a 
codon distribution as equitable as available with 
standard methods of oligonucleotide synthesis, it results 
in a bias against peptides containing one-codon residues. 
For example, a complete collection of hexacodons contains 
one sequence encoding each peptide made up of only one* 
codon amino acids, but contains 729 (3 6 ) sequences 
encoding each peptide with only three-codon amino acids. 

An alternative approach to minimize the bias against 
one-codon residues involves the synthesis of 20 activated 
tri -nucleotides , each representing the codon for one of 
the 20 genetically encoded amino acids. These are 
synthesized by conventional means, removed from the 
support but maintaining the base and 5-HO-protecting 
groups, and activating by the addition of 3»0- 
phosphoramidite (and phosphate protection with b- 
cyanoethyl groups) by the method used for the activation 
of mononucleosides, as generally described in HcBride and 
Caruthers, Tetrahedron Letters 22:245 (1983), which is 
incorporated by reference herein. Degenerate 
"oligocodons" are prepared using these trimers as 
building blocks. The trimers are mixed at the desired 
molar ratios and installed in th synthesizer. The 
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ratios will usually be approximately quimolar, but may 
b a control 1 d unequal ratio to obtain the over- to 
under-representation of c rtain amino acids coded for by 
the degenerate oligonucleotide collection. The 
condensation of the trimers to form the oligocodons is 
done essentially as described for conventional synthesis 
employing activated mononucleosides as building blocks. 
See generally . Atkinson and Smith, oligonucleotide 
Synthesis . N.J. Gait, ed. p35-82 (1984). Thus, this 
procedure generates a population of oligonucleotides for 
cloning that is capable of encoding an equal distribution 
(or a controlled unequal distribution) of the possible 
peptide sequences. This approach may be especially 
useful in generating longer peptide sequences, since the 
range of bias produced by the (NNK) 6 motif increases by 
three-fold with each additional amino acid residue. 

When the codon motif is (NKK) x , as defined 
above, and when x equals 8, there are 2.6 x 10 10 possible 
octa -peptides. A library containing most of the octa- 
peptides may be difficult to produce. Thus, a sampling 
of the octa-peptides may be accomplished by constructing 
a subset library using of about .1%, and up to as much as 
1%, 5% or 10% of the possible sequences, which subset of 
recombinant bacteriophage particles is then screened. As 
the library size increases, smaller percentages are 
acceptable. If desired, to extend the diversity of a 
subset library the recovered phage subset may be 
subjected to mutagenesis and then subjected to subsequent 
rounds of screening. This mutagenesis step may be 
accomplished in two general ways: the variable region of 
the recovered phage may be mutagenized, or additional 
variable amino acids may be added to the regions 
adjoining the initial variable sequences. 

A variety of techniques can be used in the 
present invention to diversify a peptide library or to 
div rsify around peptides found in early rounds of 
panning to have sufficient binding activity. In one 
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approach, th positive phage (those identified in an 
early round of panning) are sequenced to d termine the 
identity of the active p ptides. Oligonucleotides are 
then synthesized based on these peptide sequences, 
employing a low level of all bases incorporated at each 
step to produce slight variations of the primary 
oligonucleotide sequences. This mixture of (slightly) 
degenerate oligonucleotides is then cloned into the 
affinity phage as described herein. This method produces 
systematic, controlled variations of the starting peptide 
sequences. It requires, however, that individual 
positive phage be sequenced before mutagenesis, and thus 
is useful for expanding the diversity of small numbers of 
recovered phage. 

Another technique for diversifying around the 
recognition kernal of the selected phage-peptide involves 
the subtle mis incorporation of nucleotide changes in the 
peptide through the use of the polymerase chain reaction 
(PCR) under low fidelity conditions. A protocol of Leung 
at al., Technique 1:11-15 (1989) alters the ratios of 
nucleotides and the addition of manganese ions to produce 
a 2% mutation frequency. Yet another approach for 
diversifying the selected phage involves the mutagenesis 
of a pool, or subset, of recovered phage. Phage 
recovered from panning are pooled and single stranded DNA 
is isolated. The DNA is mutagenized by treatment with, 
e.g., nitrous acid, formic acid, or hydrazine. These 
treatments produce a variety of damage in the DNA. The 
damaged DNA is then copied with reverse transcriptase 
which misincorporates bases when it encounters a site of 
damage. The segment containing the sequence encoding the 
variable peptide is then isolated by cutting with 
restriction nuclease (s) specific for sites flanking the 
variable region. This mutagenized segment is then 
recloned into undamaged vector DNA in a manner similar to 
that described herein. The DNA is transformed into cells 
and a s condary library is constructed as described. The 
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g n ral mutagenesis method is described in detail in 
Myers, t al., Nucl. Acids Res . 13:3131-3145 (1985), 
Myers et al., Science 229:242-246 (1985), and Myers, 
?« TT ent Protocols In Molecular Biology Vol I, 8.3.1 - 
8.3.6, F. Ausebel, et al., eds, J. Wiley and Sons, New 
York. (1989) , each of which are incorporated herein by 
reference. 

In the second general approach, that of adding 
additional amino acids to a peptide or peptides found to 
be active, a variety of methods are available. In one, 
the sequences of peptides selected in early panning are 
determined individually and new oligonucleotides, 
incorporating the determined sequence and an adjoining 
degenerate sequence, are synthesized. These are then 
cloned to produce a secondary library. 

In another approach which adds a second 
variable region to a pool of peptide-bearing phage, a 
restriction site is installed next to the primary 
variable region. Preferably, the enzyme should cut 
outside of its recognition sequence, such as BspMI which 
cuts leaving a four base 5* overhang, four bases to the 
3* side of the recognition site. Thus, the recognition 
site may be placed four bases from the primary degenerate 
region. To insert a second variable region, the pool of 
phage DNA is digested and blunt-ended by filling in the 
overhang with Klenow fragment. Double-stranded, blunt- 
ended, degenerately synthesized oligonucleotide is then 
ligated into this site to produce a second variable 
region juxtaposed to the primary variable region. This 
secondary library is then amplified and screened as 
before. 

While in some instances it may be appropriate 
to synthesize peptides having contiguous variable regions 
to bind certain receptors, in other cases it may be 
desirable to provide peptides having two or more regions 
of diversity s parat d by spacer residues. For example, 
the variable regions may be separated by spacers which. 
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allow the diversity domains of the peptides to be 
presented to the receptor in different ways. The 
distance between variable regions may be as little as on 
residue, sometimes five to ten and up to about 100 
residues. For probing a large binding site the variable 
regions may be separated by a spacer of residues of 20 to 
30 amino acids. The number of spacer residues when 
present will preferably be at least two, typically at 
least three or more, and often will be less than ten, 
more often less than eight residues. 

Thus, an oligonucleotide library having 
variable domains separated by spacers can be represented 
by the formula: 

(NNK) y -(abc) n -(NNK) z , 
where N and K are as defined previously (note that S as 
defined previously may be substituted for K) , and y + z 
is equal to about 5, 6, 7, 8, or more, a, b and c 
represent the same or different nucleotides comprising a 
codon encoding spacer amino acids, n is up to about 20 to 
30 amino acids or more. 

The spacer residues may be somewhat flexible, 
comprising oligo-glycine, for example, to provide the 
diversity domains of the library with the ability to 
interact with sites in a large binding site relatively 
unconstrained by attachment to the phage protein. Rigid 
spacers, such as, e.g., oligo-proline, may also be 
inserted separately or in combination with other spacers, 
including Gly. It may be desired to have the variable 
domains close to one another and use a spacer to orient 
the variable domain with respect to each other, such as 
by employing a turn between the two sequences, as might 
be provided by a spacer of the sequence Gly-Pro-Gly, for 
example. To add stability to such a turn, it may be 
desirable or necessary to add Cys residues at either or 
both ends of each variable region. The Cys residues 
would then form disulfide bridges to hold the variable 
regions together in a loop, and in this fashion may also 
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serve to mimic a cyclic peptide. Of course, those 
skilled in th art will appreciate that various other 
types of covalent linkages for cyclization may also be 
accomplished. 

The spacer residues described above can also be 
situated on either or both ends of the variable 
nucleotide region. For instance , a cyclic peptide may be . 
accomplished without an intervening spacer, by having a 
Cys residue on both ends of the peptide. As above, 
flexible spacers, e.g., oligo-glycine, may facilitate 
interaction. of the peptide with the selected receptors. 
Alternatively, rigid spacers may allow the peptide to be 
presented as if on the end of a rigid arm, where the 
number of residues, e.g., Pro, determines not only the 
length of the arm but also the direction for the arm in 
which the peptide is oriented. Hydrophilic spacers, made 
up of charged and/ or uncharged hydrophilic amino acids, 
(e,g., Thr, His, Asn, Gin, Arg, Glu, Asp, Met, Lys, 
etc.), or hydrophobic spacers of hydrophobic amino acids 
(e.g., Phe, Leu, lie, Gly, Val, Ala, etc.) may be used to 
present the peptides to binding sites with a variety of 
local environments. 

Unless modified during or after synthesis by 
the translation machinery, recombinant peptide libraries 
consist of sequences of the 20 normal L-amino acids. 
While the available structural diversity for such a 
library is large, additional diversity can be introduced 
by a variety of means, such as chemical modifications of 
the amino acids. 

For example, as one source of added diversity a 
peptide library of the invention can have its carboxy 
terminal amidated. Carboxy terminal amidation is 
necessary to the activity of many naturally occurring 
bioactive peptides. This modification occurs In vivo 
through cleavage of the N-C bond of a carboxy terminal 
Gly residu in a two-st p reaction catalyzed by the 
enzym s p ptidylglycine alpha-amidation monooxygenase 
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(PAN) and hydroxyglycine aminotransferas (HGAT) . See . 
Eipper et al., J. Bi 1. Chem. 266:7827-7833 (1991); 
Mizuno et al., Biochem, Bioohve, Res. Comm. 137(3): 984- 
991 (1986); Murthy et al., J. Biol. Chem. 261(4): 1815- 
1822 (1986); Katopodis et al., Biochemistry 29:6115-6120 
(1990); and Young and Tamburini, J. Am. Chen. Soc. 
111:1933-1934 (1989), each of which are incorporated 
herein by reference. 

Carboxy terminal amidation can be made to a 
peptide library of the invention which has the variable 
region exposed at the carboxy terminus . Amidation can be 
performed by treatment with enzymes, such as PAH and 
HGAT, in vivo or in zifctfi/ and under conditions conducive 
to maintaining the structural integrity of the bioactive 
peptide. In a random peptide library of the present 
invention, amidation will occur on a library subset, 
i.e., those peptides having a carboxy terminal Gly. A 
library of peptides designed for amidation can be 
constructed by introducing a Gly codon at the end of the 
variable region domain of the library* After amidation, 
an enriched library serves as a particularly efficient 
source of ligands for receptors that preferentially bind 
amidated peptides. 

Many of the C-terminus amidated bioactive 
peptides are processed from larger pro-hormones, where 
the amidated peptide is flanked at its C-terminus by the 
sequence -Gly-Lys-Arg-X. . . (where X is any amino acid). 
In the present invention, oligonucleotides encoding the 
sequence -Gly-Lys-Arg-X-Stop are placed at the 3 • end of 
the variable oligonucleotide region. When expressed, the 
Gly-Lys-Arg-X is removed by in vivo or in vitro enzymatic 
treatment and the peptide library is carboxy terminal 
amidated as described above. 

Another means to add to the library diversity 
through carboxy terminal amidation involves the use of 
proteins that typically have an exposed C terminus, i.e., 
a prot in that cross s a membrane with its carboxy 
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terminus exposed on the extracellular sid of th 
membrane. In this mb dim nt the variable 
oligonucleotides region, having a stop codon in th last 
position, is inserted in the 3 1 end of a sequence which 
5 encodes C terminus exposed protein, or at least a portion 

of the protein that is responsible for the C-terminus out 
orientation. The transferrin receptor protein is an 
example of one such protein. This receptor has been 
cloned and sequenced, as reported in McClelland et al., 

10 cell 39:267-274 (1984), incorporated herein by reference. 

An internal transmembrane segment of the transferrin 
receptor serves to orient the protein with its carboxy 
terminus out. When the cDNA is expressed, typically in 
eucaryotic cells, the random peptides are located 

15 extracellularly, having their amino terminus fused to the 

transferrin receptor and with a free carboxy terminus. 

For carboxy terminal peptide libraries, a COS 

cell expression cloning system can also be used and may 

be preferred in some circumstances. COS cells are 

20 transfected with a variable nucleotide library contained 

in an expression plasraid that replicates and produces 
mRNA extrachromosomally when transfected into COS cells. 
Transfected cells bearing the random peptides are 
selected on immobilized ligand or cells which bear a 

25 binding protein, and the plasmid is isolated (rescued) 

from the selected cells. The plasmid is then amplified 
and used to trans feet COS cells for a second round of 
screening. Because the random oligonucleotides are 
inserted directly into the expression plasmid, much 

30 larger libraries (i.e., total number of novel peptides) 

are constructed. Of course, for each round of panning 
the plasmid needs to be rescued from the COS cells, 
transfected into bacteria for amplification, re-isolated 
and transfected back into COS cells. 

35 Other expression systems for carboxy terminal 

amidation of peptides of the invention can also be used. 
For example, the variable oligonucleotide s guenc s are 
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inserted into the 3' end of, e.g. , the transferrin 
recept r cDNA contained in a baculovirus transfer vector. 
Viral DNA and trans f r v ctpr ar co-trans fected into 
insect cells (e.g., Sf9 cells) which are used to 
propagate the virus in culture. When transferrin 
receptor is expressed, cells harboring recombinant virus, 
i.e.', those producing the transferrin recept or/ variable 
peptide fusion protein, are selected using an anti- 
transferrin receptor monoclonal antibody linked to a 
particle such as magnetic microspheres or other substance 
to facilitate separation. The selected cells are further 
propagated, allowed to lyse and release the library of 
recombinant extracellular budded virus into the media. 

The library of recombinant virus is amplified 
(e.g., in Sf9 cells), and aliquots of the library stored. 
Sf9 cells are then infected with the library of 
recombinant virus and panned on immobilized target 
receptor, where the panning is timed to occur with 
transferrin receptor expression. The selected cells are 
allowed to grow and lyse, and the supernatant used to 
infect new Sf9 cells, resulting in amplification of virus 
that encodes peptides binding to the target receptor. 
After several rounds of panning and amplification, single 
viruses are cloned by a Sf9 cell plague assay as 
described in Summers and Smith, A Manual of Methods for- 
Baculovirus Vectors and Insect Cell Culture Procedures . 
Texas Agricultural Experiment Station Bulletin No. 1555 
(1988) , incorporated herein by reference. DNA in the 
variable oligonucleotide insert region is then sequenced 
to determine the peptides which bind to the target 
receptor. 

An advantage of the baculovirus system for 
peptide library screening is that expression of the 
transferrin receptor/random peptide fusion protein is 
very high (>1 millions receptors per cell). A high 
expression level increases the likelihood of successful 
panning based on stoichiometry and/or contributes to 
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polyvalent int Tactions with an immobilized targ t 
r ceptor. Another advantage of the baculovirus system is 
that, similar to the peptide on phage method, infectivity 
is exploited to amplify virus which is selected by the 
panning procedure. During the series of pannings, the 
DNA does not need to be isolated and used for subsequent 
transfections of cells. 

Other expression systems can be employed in the 
present invention. As eucaryotic signal sequences are 
operable in yeast and bacteria, proteins with a carboxy 
terminus out orientation, such as the transferrin 
receptor, can be appropriately expressed and oriented in 
yeast or bacteria. The use of yeast or bacteria allows 
large libraries and avoids potential problems associated 
with amplification. 

Other modifications found in naturally 
occurring peptides and proteins can be introduced into 
the libraries to provide additional diversity and to 
contribute to a desired biological activity. For 
example, the variable region library can be provided with 
codons which code for amino acid residues involved in 
phosphorylation , glycosylation , sulfation , isoprenylation 
(or the addition of other lipids) , etc. Modifications 
not catalyzed by naturally occurring enzymes can be 
introduced by chemical means (under relatively mild 
conditions) or through the action of, e.g., catalytic 
antibodies and the like. In most cases, an efficient 
strategy for library construction involves specifying the 
enzyme (or chemical) substrate recognition site within or 
adjacent to the variable nucleotide region of the library 
so that most members of the library are modified. The 
substrate recognition site added could be simply a single 
residue (e.g., serine for phosphorylation) or a complex 
consensus sequence, as desired. 

Conformational constraints, or scaffolding, can 
also be introduced into the structure of the p ptid 
librari s. A number f motifs from known protein and 
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peptide structures can be adapted for this purpose. The 
method involves introducing nucleotid sequences that 
code for conserved structural residues into r adjacent 
to the variable nucleotide region so as to contribute to 
the desired peptide structure. Positions nonessential to 
the structure are allowed to vary. 

A degenerate peptide library as described 
herein can incorporate the conserved frameworks to 
produce and/or identify members of families of bioactive 
peptides or their binding receptor elements. Several 
families of bioactive peptides are related by a secondary 
structure that results in a conserved "framework," which 
in some cases is a pair of cysteines that flank a string 
of variable residues. This results in the display of the 
variable residues in a loop closed by a disulfide bond, 
as discussed above. 

In some cases a more complex framework is 
shared among members of a peptide family which 
contributes to the bioactivity of the peptides. An 
example of this class is the conotoxins, peptide toxins 
of 10 to 30 amino acids produced by venomous molluscs 
known as predatory cone snails. The conotoxin peptides 
generally possess a high density of disulfide cross- 
linking. Of those that are highly cross-linked, most 
belong to two groups, rou and omega, that have conserved 
primary frameworks as follows: 

mu CC....C C CC; and 

omega C C CC. . . . .C C 

The number of residues flanked by each pair of C's varies 
from 2 to 6 in the peptides reported to date. The side 
chains of the residues which flank the Cys residues are 
apparently not conserved in peptides with different 
specificity, as in peptides from different species with 
similar or identical specificities. Thus, the conotoxins 
have exploited a conserved, densely cross-linked motif as 
a framework for hypervariable regions to produce a huge 
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array f p ptides with many different pharmacological 
eff cts. 

The xnu and omega classes (with 6 C's) have 15 
possible combinations of disulfide bonds. Usually only 
one of these conformations is the active ("correct") 
form. The correct folding of the peptides may be 
directed by a conserved 40 residue peptide that is 
cleaved from the N-terminus of the conopeptide to produce 
the small, mature bioactive peptides that appear in the 
venom. 

With 2 to 6 variable residues between each pair 
of C's, there are 125 (5 3 ) possible framework arrangements 
for the mu class (2,2,2 r to 6,6,6), and 625 (5 4 ) possible 
for the omega (2,2,2,2 to 6,6,6,6). Randomizing the 
identity of the residues within each framework produces 
10 10 to >10 30 peptides. M Cono-like n peptide libraries are 
constructed having a conserved disulfide framework, 
varied numbers of residues in each hypervariable region, 
and varied identity of those residues. Thus, a sequence 
for the structural framework for use in the present 
invention comprises Cys-Cys-Y-Cys-Y-Cys-Cys, or Cys-Y- 
Cys-Y-Cys-Cys-Y-Cys-Y-Cys , wherein Y is (NNK) X or (NNS) x , 
and where N is A, C, G or T, K is G or T, S is G or C, 
and x is from 2 to 6. 

Other changes can be introduced to provide 
residues that contribute to the peptide structure, around 
which the variable amino acids are encoded by the library 
members. For example, these residues can provide for a- 
helices, a helix-turn-helix structure, four helix 
bundles, etc., as described. 

Another exemplary scaffold structure takes 
advantage of metal ion binding to conf ormationally 
constrain peptide structures. Properly spaced invariant 
metal ligands (cysteines and histidines) for certain 
divalent cations (e.g. , zinc, cobalt, nickel, cadmium, 
etc.) can be specifically incorporated into the peptide 
libraries. Depending on the particular motif specified 
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these can result (in the case of zinc coordinati n, for 
exampl ) in zinc loops, zinc fingers, zinc twists, or 
zinc clusters, as generally described in Berg ( J. Biol. 
Cheiru 265:6513-6516 (1990)), Green et al. f Proc. Natl. 
Acad, sci. USA, 86:4047-4051 (1989)), Parraga et al. 
( Science 241:1489-1492 (1988)), Began et al. f Biochem . . 
29:10878-10883 (1990)), and Vallee et al. ( Proc. Natl. 
Acad. Sci. USA . 88:999-1003 (1991)), each incorporated 
herein by reference. Other DNA binding peptides, such as 
those which correspond to the transcriptional 
transactivators referred to as leucine zippers, can also 
be used as a framework, where leucine residues are 
repeated every seven residues in the motifs, and the 
region is adjacent to an alpha helical region rich in 
lysines and arginines and characterized by a conserved 
helical face and a variable helical face. 

Other specialized forms of structural 
constraints can also be used in the present invention. 
For example, certain serine proteases are inhibited by 
small proteins of conserved structure (e.g., pancreatic 
trypsin inhibitor) . This conserved framework can 
incorporate degenerate regions as described herein to 
generate libraries for screening for novel protease 
inhibitors. 

In another aspect related to frameworks for a 
peptide library, information from the structure of known 
ligands can be used to find new peptide ligands having 
features modified from those of the known ligand. In 
this embodiment, fragments of a gene encoding a known 
ligand, prepared by, e.g., limited DNAse digestion into 
pieces of 20 to 100 base pairs, are subcloned into a 
variable nucleotide region system as described herein 
either singly or in random combinations of several 
fragments. The fragment library is then screened in 
accordance with the procedures herein for binding to the 
receptor to identify small peptides capable of binding to 
the receptor and having characteristics which differ as 
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desired from the parental peptide ligand. This is useful 
for screening for any receptor-ligand interaction where 
one or both members are encoded by a gene, e.g., growth 
factors, hormones, cytokines and the like, such as 
insulin, interleukins, insulin-like growth factor, etc. 

The peptide-phage libraries of the present 
invention can also be used to determine the site 
specificity of enzymes that modify proteins, e.g., the 
cleavage specificity of a protease. For example, factor 
X a cleaves after the sequence Zle-Glu-Gly-Arg. A library 
of variable region codons as described herein is 
constructed having the structure: signal sequence-- 
variable region — Tyr-Gly-Gly-Phe-Leu — pZZI. Phage from 
the library are then exposed to factor and then panned 
on an antibody (e.g., 3E7) , which is specific for N- 
terminally exposed Tyr-Gly-Gly-Phe-Leu. A pre-cleavage 
panning. step with 3E7 can be employed to eliminate clones 
cleaved by £. coli proteases. Only members of the 
library with random sequences compatible with cleavage 
with factor X a are isolated after panning, which sequences 
mimic the Zle-Glu-Gly-Arg site. 

Another approach to protease substrate 
identification involves placing the variable region 
between the carrier protein and a reporter sequence that 
is used to immobilize the complex (e.g., Tyr-Gly-Gly-Phe- 
Leu). Libraries are immobilized using a receptor that 
binds the reporter sequence (e.g., 3E7 antibody). Phage 
clones having sequences compatible with cleavage are 
released by treatment with the desired protease. 

Some peptides, because of their size and/or 
sequence, may cause severe defects in the infectivity of 
their carrier phage. This causes a loss of phage from 
the population during reinfection and amplification 
following each cycle of panning. To minimize problems 
associated with defective infectivity, DNA prepared from 
the eluted phage is transformed into appr priate host 
cells, such as, .g., L. coli . pr ferably by 



WO 91/19818 



PCT/US91/04384 



24 

1 ctr poration, as described in, for example, Dover t 
al., Nucl. Acids Res. 16:6127-6145 (1988), incorporated 
herein by reference, or by veil known chemical means. 
The cells are cultivated for a period of time sufficient 
for marker expression, and selection is applied as 
typically done for DNA transformation. The colonies are 
amplified, and phage harvested for affinity enrichment as 
described below. Phage identified in the affinity 
enrichment can be re-amplified in additional rounds of 
propagation by infection into appropriate hosts. 

The successful trans formants are typically 
selected by growth in a selective medium or under 
selective conditions, e.g., an appropriate antibiotic, 
which, in the case of the fd-tet vector, is preferably 
tetracycline. This may be done on solid or in liquid 
growth medium. For growth on solid medium, the cells are 
grown at a high density (~10 8 to 10 9 tfs per m 2 ) on a 
large surface of, for example, L-agar containing the 
selective antibiotic to form essentially a confluent 
lawn. The cells and extruded phage are scraped from the 
surface and phage are prepared for first round of panning 
essentially as described by Parmley and Smith, Gene 
73:305-318 (1988). For growth in liquid culture, cells 
may be grown in L-broth and antibiotic through about 10 
or more doublings. The phage are harvested by standard 
procedures (see Sambrooke et al.. Molecular Cloning . 2nd 
ed. (1989), supra . for preparation of M13 phage) as 
described below. Growth in liquid culture may be more 
convenient because of the size of the libraries, while 
growth on solid media likely provides less chance of bias 
during the amplification process. 

For affinity enrichment of desired clones, 
generally about 10 3 to 10 4 library equivalents (a library 
equivalent is one of each recombinant; 10 4 equivalents of 
a library of 10 9 members is 10 9 x 10 4 « 10 X3 phage) , but 
typically at least 10 2 library equivalents but up t about 
10 5 to 10 6 , are incubated with a receptor (or portion . 



WO 91/19818 



PCI7US91/04384 



25 

ther f) to which the desir d p ptide is s ught. The 
receptor is in ne of sev ral forms appropriate for 
affinity enrichment schemes. In n example the receptor 
is immobilized on a surface or particle , and the library 
of phage bearing peptide is then panned on the 
immobilized receptor generally according to the procedure 
described below. 

A second example of receptor presentation is 
receptor attached to a recognizable ligand (which may be 
attached via a tether) . A specific example of such a 
ligand is biotin. The receptor, so modified, is 
incubated with the library of phage and binding occurs 
with both reactants in solution. The resulting complexes 
are then bound to streptavidin (or avidin) through the 
biotin moiety. The streptavidin may be immobilized on a 
surface such as a plastic plate or on particles, in which 
case the complexes 

(phage/peptide/receptor/biotin/ streptavidin) are 
physically retained; or the streptavidin may be labelled, 
with a fluorophore, for example, to tag the active 
phage/peptide for detection and/or isolation by sorting 
procedures, e.g., on a fluorescence-activated cell 
sorter. 

Phage which express peptides without the 
desired specificity are removed by washing. The degree 
and stringency of washing required will be determined for 
each receptor/peptide of interest. A certain degree of 
control can be exerted over the binding characteristics 
of the peptides recovered by adjusting the conditions of 
the binding incubation and the subsequent washing. The 
temperature, pH, ionic strength, divalent cation 
concentration, and the volume and duration of the washing 
will select for peptides within particular ranges of 
affinity for the receptor. Selection based on slow 
dissociation rate, which is usually predictive of high 
affinity, i6 the most practical route. This may be done 
ith r by continued incubation in th presence of a 
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saturating am tint of free ligand, or by increasing the 
volume, number, and length of the washes. In each case, 
th rebinding of dissociated peptide-phage is prevented, 
and with increasing time, peptide-phage of higher and 
higher affinity are recovered. Additional modifications 
of the binding and washing procedures may be applied to 
find peptides which bind receptors under special 
conditions . 

Although the phage screening method is highly 
specific, the procedure generally does not discriminate 
between peptides of modest affinity (micromolar 
dissociation constants) and those of high affinity 
(nanomolar dissociation constants or greater) . The 
ability to select phage bearing peptides with relatively 
low affinity may be result of multivalent interaction 
between a phage/peptide particle and a receptor. For 
instance, when the receptor is an IgG antibody, each 
phage bearing peptides may bind to more than one antibody 
binding site, either by a single phage binding to both 
sites of single IgG molecule or by forming network of 
phage-IgG, which multivalent interaction produces a high 
avidity and tenacious adherence of the phage during 
washing. 

To enrich for the highest affinity peptide 
ligands, a substantially monovalent interaction between 
phage and the receptor (typically immobilized on a solid- 
phase) may thus be appropriate. The screening 
(selection) with substantially monovalent interaction can 
be repeated as part of additional rounds of amplification 
and selection of bacteriophage. Thus, under these 
circumstances the receptor molecule is substantially 
monovalent, such as the Fab binding fragment of an 
antibody molecule. 

A strategy employing a combination of 
conditions favoring multivalent or monovalent 
interactions can be used to advantage in producing new 
peptide ligands for recept r molecules. By conducting 
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the first rounds of scr ening under conditions t promote 
multivalent interactions, high stringency washing can be 
used to greatly reduce the background of non-sp cifically 
bound phage. This high avidity step nay select a large 
pool or peptides with a vide range of affinities, 
including those with relatively low affinity. It may 
select for specific recognition kernels, such as the Tyr- 
Gly dipeptide described in the examples below. 
Subsequent screening under conditions favoring 
increasingly monovalent interactions and isolation of 
phage based on a slow dissociation rate may then allow 
the identification of the highest affinity peptides. 
Monovalent interactions may be achieved by employing low 
concentrations of receptor (for example, from about 1 to 
100 pM) . 

It should be noted that, as an aspect of the 
present invention, determining a dissociation rate for a 
peptide of interest and the selected receptor molecule 
under substantially monovalent conditions allows one to 
extrapolate the binding affinity of the peptide for the 
receptor. This procedure avoids the necessity and 
inconvenience of separately determining binding 
affinities for a selected peptide, which could be 
especially burdensome if a large number of peptides have 
been selected. 

Once a peptide sequence that imparts some 
affinity and specificity for the receptor molecule is 
known, the diversity around this "recognition kernel" may 
be embellished. For instance, variable peptide regions 
may be placed on one or both ends of the identified 
sequence. The known sequence may be identified from the 
literature, as in the case of Arg-Gly-Asp and the 
integrin family of receptors, for example, as described 
in Ruoslahti and Pierschbacher, Science 238:491-497 
(1987) , or may be derived from early rounds of panning in 
the context of the present invention. 
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Libraries of peptides on phage produced and 
screened acc rding to th present invention are 
particularly useful for mapping antibody epitopes. The 
ability to sample a large number of potential epitopes as 
described herein has clear advantages over the methods 
based on chemical synthesis now in use and described in, 
among others, Geysen et al. f J. Immunol. Meth. 102:259- 
274 (1987) . In addition, these libraries are useful in 
providing new ligands for important binding molecules, 
such as hormone receptors, adhesion molecules, enzymes, 
and the like. 

Accordingly, the following examples are offered 
by way of illustration, not by way of limitation. 

EXAMPLE I 

Reagents and Strains 

BstXI restriction endonuclease, T4 DNA ligase, 
and T4 kinase were obtained from New England Biolabs. 
Streptavidin and biotinylated goat anti-mouse IgG were 
obtained from BRL. Seguenase 2.0 was obtained from U.S. 
Biochemical. Monoclonal antibody 3E7 used in initial 
studies was provided by A. Herz and is described in Meo 
et al., infra . , incorporated herein by reference, and was 
also purchased from Gramsch Laboratories (Schwabhausen, 
West Germany) . [ 125 I-tyr 28 ]b-endorphin (2000 Ci/mmol) was 
purchased from Amersham Corp. (Arlington Heights r IL) . 
Oligonucleotides were synthesized with an Applied 
BioSystems PCR-Mate and purified on OPC columns (ABI) . 
Peptides were synthesized with an Applied BioSystems 431A 
(Foster City, CA) or Biosearch model 9600 (San Rafael, 
CA) synthesizer and purified to greater than 95% purity 
by reverse phage HPLC. Peptide content of the pure 
peptides was determined by amino acid analysis and the 
composition was verified by FAB-MS. Bacteriophage fd-tet 
and E. coli K91 were provided by G. Smith, Univ. of 
Missouri, C lumbia, M0 65211, and are described in, among 



29 



others, Zacher et al., Gene 9:127-140 (1980), Smith et 
al., Science 228:1315-1317 (1985) and Parmley and Smith, 
Gene 73:305-318 (1988). 

Construct ion of Vector fAFFl 

A filamentous bacteriophage vector was 
constructed from the tetracycline resistance transducing 
vector fdTet, described in Zacher et al., The 
vector, designated fAFFl, was designed to provide many 
choices in the size and location of the peptides 
expressed fused to the pi I I bacteriophage coat protein, 
pill is made as a preprotein with an 18 amino acid leader 
sequence that directs pi 1 1 to the inner membrane of the 
bacterial host cell before it becomes assembled into an 
intact phage particle (Goldsmith and Konigsberg, Biochem. 
16:2686-2694 (1977) and Boeke and Model, Proc. Natl. 
Acad, sci. USA 79:5200-5204 (1982) incorporated herein by 
reference) . As explained further below, a peptide 
library was constructed by cloning an oligonucleotide of 
the structure shown in Fig. IB to place the variable 
hexapeptide region at the N-terminus of the processed 
protein. These first six residues are followed by two 
glycines and then the normal sequence of pill. The 
library consists of about 3 x 10* independent 
recombinants • 

A cloning site, consisting of two non- 
complementary BstXI sites, was engineered into the 5'- 
region of gene III. As shown in Fig. 1A, two non- 
complementary BstXI sites flank the region encoding amino 
acids surrounding the signal peptidase site (the K- 
terminus of the mature pill) . fAFFl also has a -1 
frameshift mutation in pill that results in non-infective 
phage. By removing the BstXI fragment and inserting an 
oligonucleotide of the appropriate structure, (a) 
portions of the removed sequence can be precisely 
reconstructed (the correct signal peptide site, for 
example,) (b) one or mor additi nal amino acids may be 
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xpr ssed at several locations, and (c) th correct 
translation frame is restor d to produce activ , 
infective pill. 

Construction of the cloning site at the 5 1 - 
region of gene III was accomplished by first removing a 
BstXI restriction site already present in the TN10 region 
of fdTet, RF DNA was digested with BstXI restriction 
endonuclease, and T4 DNA polymerase was added to remove 
the protruding 3' termini. Blunt-ended molecules were 
then li gated and transformed into MC1061 cells* RF DNA 
isolated from several tetracycline resistant 
transformants was digested again with BstXI; a clone that 
was not cleaved was selected for construction of the 
double BstXI site. Site-directed mutagenesis (Kunkel et 
al., Meth. Enzvmol. 154:367-382 (1987), incorporated by 
reference herein) was carried out with the 
oligonucleotide 5 1 -TAT GAG GTT TTG CCA GAC AAC TGG AAC 
AGT TTC AGC GGA GTG CCA GTA GAA TGG AAC AAC TAA AGG. 
Insertion of the correct mutagenic sequence was confirmed 
by dideoxy sequencing of RF DNA isolated from several 
tetracycline -resistant transformants. 

Construction of a Diverse Oligonucleotide Library 

Oligonucleotides which were cloned have the 
general structure shown in Fig. IB. The 5 9 and 3 V ends 
have a fixed sequence , chosen to reconstruct the amino 
acid sequence in the vicinity of the signal peptidase 
site. The central portion contained the variable regions 
which comprise the oligonucleotide library members, and 
may also code for spacer residues on either or both sides 
of the variable sequence. 

A collection of oligonucleotides encoding all 
possible hexapeptides was synthesized with the sequence 
5»-C TCT CAC TCC (NNK) GGC GGC ACT GTT GAA AGT TGT-3 • . 
N was A, C, G, and T (nominally equimolar) , and K was G 
and T (nominally equimolar) . This sequence, designated 
ON-49, was ligated into the BstXI sit s of fAFFl after 
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annealing to two "half-site" oligonucleotides, ON-28 (5 1 - 
GGA GTG AGA GTA GA-3 » ) and ON-29 (5'-CTT TCA ACA GT-3 1 ) , 
which ar complementary to th 5'- and 3'- porti ns of 
ON-49, respectively. "Half-site" oligonucleotides anneal 
to the 5 1 - and 3 f - ends of oligonucleotide ON-49 to form 
appropriate BstXI cohesive ends. This left the 
appropriate BstXI site exposed without the need to digest 
with BstXI, thus avoiding the cutting of any BstXI sites 
that might have appeared in the variable region. The 
vector fAFFl (100 jig) was digested to completion with 
BstXI, heat inactivated at 65 # C, and ethanol precipitated 
twice in the presence of 2 M ammonium acetate. 
Oligonucleotides were phosphorylated with T4 kinase, and 
annealed in 20 mM Tris-HCl, pH 7.5, 2 mM MgCl 2 , 50 mM 
NaCl, by mixing 1.5 pg ON-28, 1.2 ON-29, and 0.5 pg 
ON-49 with 20 Mg BstXI-digested fAFFl RF DNA, heating to 
65 • C for 5 minutes and allowing the mixture to cool 
slowly to room temperature. This represented an 
approximate molar ratio of 1:5:100:100 (fAFFl vector: ON- 
49: ON-28: ON-29). The annealed structure is then 
ligated to BstXI-cut fAFFl RF DNA to produce a double- 
stranded circular molecule with a small, single stranded 
gap. These molecules may be transformed into host cells. 
The annealed DNA was ligated in 20 mM Tris-HCl, pH 7.5, 5 . 
mM MgCl 2 , 2 mM DTT, 1 mM ATP, by the addition of 20 units 
of T4 DNA ligase and incubated overnight at 15 # C. 

Alternatively, before transformation, the gap 
may be filled-in under conditions disfavoring secondary 
structure in the variable region.. In some experiments 
the gapped circular structure created by this ligation 
was filled in with T4 DNA polymerase in the presence of 
ligase and dNTPs (400 /xM each) to produce a covalently 
closed, double-stranded molecule (Kunkel et al., supra ) . 
The ligated DNA was ethanol precipitated in the presence 
of 0.3 M sodium acetate, resuspehded in water, and 
transformed by electroporation into MC1061. Five 
el ctro-transformations, each containing 80 ^1 of cells 
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and 4 of DNA (50 /xg/al) , were perf rmed by pulsing at 
12.5 kV/cm f r 5 msec as described in Dower et al., 
Nucleic Acids Res. 16:6127-6145 (1988), inc rporated by 
reference herein. After one hour of non-selective 
outgrowth at 37 • C in 2 ml of SOC medium (2% Bacto 
tryptone, 0.5% Bacto yeast extract, 10 mM NaCl, 2.5 mM 
KC1, 10 AM MgCl 2 , 10 mM MgS0 4 , 20 mM glucose; see Hanahan* 
«7» MOl. Biol. 166:557-580 (1983)), the transformations 
were pooled, an aliquot was removed, and several 
dilutions were plated on LB agar plates containing 
tetracycline (20 Mg/ml) to assess the transformation 
efficiency. The remainder was used to inoculate- one 
liter of 3>broth containing tetracycline (20 fig/ml) and 
was grown through approximately 10 doublings at 37 'C to 
amplify the library. 

isolation Q f Pfaqe 

Phage from liquid cultures were obtained by 
clearing the supernatant twice by centrifugation (8000 
RPM for 10 min in JA10 rotor, at 4 # ), and precipitating 
phage particles with polyethylene glycol (final 
concentration 3.3% polyethylene glycol-8000, 0.4 M NaCl), 
and centrifuged as described above. Phage pellets were 
resuspended in TBS (50 mM Tris^HCl, pH 7.5, 150 mM NaCl) 
and stored at 4'C. In some cases, phage were isolated 
from plate stocks by scraping from the agar surface, 
resuspending in L-broth, and purifying as described 
above. 

Affinity Purification 

Approximately 10 3 to 10 4 library equivalents of 
phage were reacted overnight with 1 /ig of purified 
antibody in 1 ml TBS at 4 # c. (Under these conditions, 
phage and antibody are about equimolar; therefore, 
antibody is in great excess over the phage ligand 
peptides.) Phage expressing peptides with affinity for 
mAb3E7 were isolated by a modification of the procedure 
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of Parmley and Smith, supra . A 60 x 15 mm polystyrene 
petri plate was coated with 1 ml of streptavldin solution 
(1 mg/ml streptavidin in 0.1 M NaHC0 3 , pH 8.6, 0.02% NaN 3 ) 
and incubated overnight at 4 # C. The streptavidin 
solution was removed the following day. The plate was 
filled with 10 ml blocking solution (30 mg/ml BSA, 3 
Mg/ml streptavidin in 0.1 H NaHC0 3 , pH 9.2, 0.02% NaN 3 ) 
and incubated for two hours at room temperature. 
Biotinylated goat anti-mouse IgG (2 fig) was added to the 
antibody-reacted phage library and incubated for two 
hours at 4 # C. Immediately before panning, the blocking 
solution was removed from the streptavidin-coated plate, 
and the plate was washed 3 times with TBS/0.05% Tween 20. 
The antibody-reacted phage library was then added to the 
plate and incubated for 30 min. at room temperature . The 
phage solution was removed and the plate was washed ten 
times with 10 ml TBS/0.05% Tween 20 over a period of 60 
min. at room temperature. Adherent phage were removed by 
adding 800 pi of elution buffer (1 mg/ml BSA in 0.1 N HC1 
adjusted to pH 2.2 with glycine) to the petri plate and 
incubating for 10 min. to dissociate the immune 
complexes. The eluate was removed, neutralized by 
addition of 45 pi of 2 M Tris base, and used to infect 
log phase E. coli K91 cells. 

The infected cells were then plated on LB agar 
plates containing tetracycline (20 pg/ml) , and grown 
overnight at 37 Phage were isolated from these plates 
as described above and the affinity purification process 
was repeated for two more rounds. After each round of 
panning and amplification, DNA of phage from several 
thousand colonies was pooled and sequenced to estimate 
the diversity in the cloning site. In the first two 
positions of each codon, bands of about the same 
intensity appeared in each lane, indicating the expected 
distribution of bases in these positions. In the third 
position of each codon the G band was somewhat mor 
intense than the T band. 
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After the final round f panning and 
amplificati n, a portion of the eluate was used to infect 
cells that were plated at low density on LB tetracycline 
plates. To analyze the diversity of peptide sequences in 
the library in a more direct way, we picked 52 individual 
colonies producing infectious phage, and sequenced the 
ONA of their variable regions. Individual colonies were 
picked and transferred to culture tubes containing 2 ml 
LB tetracycline and grown to saturation. Phage DNA was 
isolated and then sequenced by a method designed for the 
Beckman Bioaek Workstation employing 96-well microtiter 
plates (Mardis and Roe, Biotechnioues 7:840-850 (1989), 
incorporated by reference herein) . Single stranded DNA 
was sequenced using Sequenase 2.0 and an oligonucleotide 
sequencing primer (5'-CGA TCT AAA GTT TTG TCG TCT-3 1 ) 
which is complementary to the sequence located 40 
nucleotides to the 3' side of the second BstXI site in 
fAFFl. 

The distribution of bases at each position 
within each codon is given in Table 1. The first two 
positions of each codon have close to the expected 
eguimolar distribution of the four bases. The third 
position is significantly biased, containing about 50% 
more G than T in this sample. This bias is most likely 
introduced during the chemical synthesis of the 
oligonucleotide mixture, but may also reflect biological 
biases imposed on the expressed peptides. 
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Table l: Hucl otid Distribution in th Diversity Region 
of Infectious Phage Randomly Selected from the Library. 



5 



10 



15 



Frequency 


of each base by position in codon (%) 
K N K 


G 


31 


27 


59 


A 


22 


22 


<1 


T 


25 


26 


39 


C 


22 


24 


1 


In Fig. 2, the 


amino acid 


sequences are 


listed for the 



peptides encoded by -the oligonucleotide inserts of a 
sample of randomly selected, infectious phage. The amino 
acid content of the expressed peptides from the 52 
randomly selected infectious phage appears in Table 2. 
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Tabl 2! Amin Acid Content in the Variable Peptide of 52 
Randomly-Sel cted Infectious Phage 



10 



15 



20 



25 



30 



35 1 



40 



45 



50 



Amino 

Obs/Nom 

Acid 


Nominal 
Frequency 


Nominal 
Occurrence 


Obser 
Occu 


A 
1.42 


ft nee 


19 


27 


Q 

0.89 


O* 032 


9 


8 


U 
1.11 


0.032 


9 


10 


E* 

1.00 


ft ft^ ^ 
0 « 032 


A 

9 


9 


F 

1.33 


0.032 


9 


12 


fZ 

1.74 


A A£ C 


19 


33 


TV 

n 
0.78 


ft ft ^ ^ 

0 • 032 


9 


7 

• 


0.67 


A A^ "1 

U. 032 


9 


6 


rv 
1.78 


ft ft «J *5 

O • 032 


A 

9 


16 


L 
1.25 


0.097 


28 


35 


M 
1.11 


n m ^ 

u • UJ« 


ft 

9 


10 


0.78 




A 

9 


7 


p 
0.47 


U • OOD 


19 


9 


0 
1.67 




9 


15 


1%. 

1.04 


A no7 

0 • 097 


28 


29 


S 

1.07 


0.097 


28 


30 


T 
0.74 


0.065 


19 


14 


V 
0.95 


0.065 


19 


18 


W 
1.22 


0.032 


9 


11 


Y 
0.67 


0.032 


9 


6 



55 
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As sh vn in Table 2, the ratio of the observ d 
occurrence of each amin acid t that expected on the 
basis of codon frequency ranges from about 0.5 to 2, 
consistent with a random distribution of sequences. 

Constructing a library of peptides displayed on 
the N-terminus of processed pill necessarily alters amino 
acids in the vicinity of the signal peptidase cleavage 
site. Certain changes in the corresponding region of the 
major coat protein, pVIII, have been shown to reduce 
processing efficiency, slowing or preventing the 
incorporation of pVTII to virions. If pill were 
similarly affected, the diversity of peptides contained 
in the library would be reduced. The finding that most 
amino acids appear at each position of the variable 
peptides of randomly selected phage indicates that 
processing defects do not impose severe constraints on 
the diversity of the library. 

Isolation and sequencing of phage having high avidity for 

antj-ft-enflprphin antiftpflv- 

Monoclonal antibody 3E7 binds to B-endorphin 
and, like the 5-opioid receptor, recognizes the N- 
terminal portion of the protein (Tyr-Gly-Gly-Phe) , which 
is present on most natural opioid peptides. The antibody 
also binds tightly to leu- and met-enkephalin (YGGFL, 
YGGFM) , and a variety of related opioid peptides (Meo et 
al., Proc. Natl, Acad. Sci. USA 80:4084-4088 (1983), Herz 
et al., Life Sciences 31:1721-1724 (1982), and Gramsch et 
al., J. Neurochem. 40:1220-1226 (1983). The N-terminal 
hexapeptide library was screened against 3E7 by carrying 
out three rounds of panning, elution, and amplification. 
The recoveries of phage from this process are shown in 
Table 3. In each round the proportion of phage adsorbed 
to the antibody increased by about 100-fold, and in the 
last round, over 30% of the input phage were recovered. 
These results indicated that phage were preferentially 
enriched in each panning step. 
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10 



15 



Table 3: Recovery of Phage from Panning on mAb3E7 



Rounds of 
Recovery 
Panning 
Input/Eluted 



10" 



10 



-3 



10 



-1 



Input of 
Phage 



4.0 x 10 



2.0 X 10 



11 



1.8 X 10 



10 



Eluted 
Phage 

1.9 x 10 7 

5.0 X 10 a 

5.6 x 10* 



4.8 X 



2.5 X 



3.1 X 



20 



25 



30 



35 



40 



After each round of panning, DNA representing 
several thousand eluted phage was pooled and sequenced. 
The area of the sequencing gel corresponding to the 
insertion site in gene III is shown in Fig. 3. The codon 
TCC specifying the serine that precedes the variable 
region is indicated by an arrow. After the first round 
of panning, the codon following this serine was clearly 
enriched in TAT (the single codon for tyrosine) . After 
the second round, virtually all first codons in the 
pooled DNA appeared to be TAT. The second codons are 
strongly GGK (the two codons for glycine) . After three 
rounds of panning, it appeared that phage containing 
relatively few kinds of amino acids in the first four 
positions had been selected, whereas the fifth and sixth 
positions appeared to be as diverse as those in the 
starting phage population. 

The DNA samples from 51 individual phage 
recovered from the third panning were sequenced. The 
deduced amino acid sequences of the N-terminal 
hexapeptides are shown in Pig. 4 and the amino acid 
distributions of these peptides are summarized in Table 
4. Each of the 51 panned phages analyzed had an N- 
terminal tyrosine, and nearly all (94%) had a glycine in 
th second position. The third position in our sample is 
occupied by many amino acids, some of which are present 
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more often than would b exp cted by chance. The fourth 
positi n is occupied primarily by the large aromatic 
residues Trp and Phe (together 50%) , and the bulky 
hydrophobic residues Leu and lie (an additional 45%) • 
5 The fifth and sixth positions contain essentially random 

distributions of amino acids, with only alanine appearing 
at slightly greater than chance in position five. 



10 

Table 4: Distribution of Amino Acids in the Diversity 
Peptide of 51 Phage Selected by Panning 
With Anti-endorphin Antibody 

15 " 
Residue Amino Nominal Observed 
Enrichment* ^ 

Position Acid Frequency Fr»qu«ncy Ratio 



20 



25 



Y .031 1.00 33 

G .062 0.94 16 

A,S <1 



3 G .062 0.31 5 
W .031 0.10 3 
S .093 0.21 2 
A .062 0.12 2 

30 N .031 0.06 2 

D,E,F,K, <1 
L,M,P,T,V 

4 W .031 0.31 10 
35 F .031 0.19 6 

L .093 0.35 4 

I .031 0.10 3 

A,G,M <1 

40 a. Observed frequency divided by nominal frequency « 
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EXAMPLE II 
Binding Affinities of Peptides 
for Rec ptor Monad nal Antibody 3E7 
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Th affinity f peptides for the 3E7 antibody 
has been previously determined f or th se peptides related 
to naturally-occurring opioid peptides. Meo et al M 
5 supra . As none of the peptides identified by the 

procedure described herein had been previously described, 
six of these peptides were chemically synthesized and 
their binding affinities estimated. 

The peptides were synthesized according to veil 

10 known protocols , as described in, for example , 

Merrifield, Science 232; 341-3*7 (1986), incorporated by 
reference herein. A solution radioimmunoassay was used 
to estimate the binding affinities of peptides for mAb 
3E7. Solution radioimmunoassay using [ 125 I]b-endorphin 

15 (20 , 000 cpm) and purified 3E7 antibody (0.25 Mg/*1) was 

conducted as described by Meo et al., supra . with the 
exception that the final volume was 150 pi. Antibody- 
bound and free [ 125 I]b-endorphin were separated by 
addition of activated charcoal followed by 

20 centrifugation, as described in Ghazarassian et al., Life 

Sciences 27s 75-86 (1980). Antibody-bound [ 125 I]b- 
endorphin in the supernatant was measured in a gamma 
counter. For each peptide, inhibition of [ 12S I]b- 
endorphin was determined at six different concentrations 

25 at 1/3 log unit intervals and the 50% inhibitory 

concentration (IC50) was determined by fitting the data 
to a two-parameter logistic equation using the ALLFIT 
program, as described in DeLean et al., Am. J. Phvsiol, 
235:E97-E102 (1978). 

30 The previously reported high degree of 

specificity of the 3E7 antibody for the intact N- terminal 
epitope Tyr-Gly-Gly-Phe which is common to naturally 
occurring opioid peptides. Meo et al., supra , was 
verified. Removal of Tyr or deletion of any of the amino 

35 acids of the sequence Tyr-Gly-Gly-Phe-Leu had deleterious 

effect on binding affinity (Table 5) . * 
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Shown in Table 5 are the IC50 for the six 
peptides which were identified by the phage panning 
method and chemically synthesized*. Under the conditions 
of the radioimmunoassay (30 pM [ 125 I]b -endorphin; 20% 
5 tracer bound; 18 hr. incubation), the IC50 should be very 

close to the dissociation constant (Kd) for the peptide. 
The peptides are all relatively low affinity compared to 
YGGFL, with IC50's ranging from 0.35 to 8.3 /iM. 
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Table 5: Relative affinities of peptides for 3E7 antibody 
determined by solution radioimmunoassay.* 





Peptide 


N 


IC5C 


► (MM) 


Affinity 
Relative to 

YGGFL 


20 


YGGFL 


(6) 


0.0071 


(0.0054,0.0093) 


1 




YGGF 


(3) 


0.19 


(0.093,0.38) 


0.037 




YGGL 


(3) 


3.8 


(2.1,6.6) 


0.0018 




YGFL 


(3) 


28 


(17,47) 


0.00025 




YGG 


(2) 


>1000 




<0. 0000071 


25 


GGFL 


(2) 


>1000 




<0. 0000071 




GGF 


(2) 


>1000 




<0. 0000071 




GFL 


(2) 


>1000 




<0. 0000071 




YGFWGM 


(3) 


0.35 


(0.19,0.63) 


0.020 




YGPFWS 


(3) 


1.9 


(1.3,2.8) 


0.0037 


30 


YGGFPD 


(3) 


2.3 


(1.4,3.7) 


0.0031 




YGGWAG 


(3) 


7.8 


(6.0,10) 


0.00091 




YGNWTY 


(3) 


7.8 


(4.0,15) 


0.00091 




YAGFAQ 


(3) 


8.3 


(3.8,18) 


0.00086 



35 

* » Data are geometric means and 95% confidence intervals 
(calculated from S.E.M. of log IC50) from the number (N) 
of independent determinations indicated. 



40 The data indicate that although the phage panning method 

is highly specific in that no unrelated peptides were 
selected, the procedure apparently does not discriminate 
between those of moderate (pM Kd) and high (nM Kd) 
affinity. The six peptides chosen from among the 51 

45 clones that were sequenced were only a small subset of 

those which were selected by three rounds of panning. 
Based on their structural diversity, the phage library 
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should contain thousands of different peptides with 
dissociation constants that are /xM or lower. 

Th panning procedure we have utilized employs 
extensive washing to remove non-spec if ically bound phage. 
Binding experiments with mAb 3E7 and [*H] YGGFL indicate a 
rapid dissociation rate, approximately t^-45 seconds at 
room temperature. Therefore, the ability to select phage 
bearing peptides with relatively low affinities may be 
the result of multivalent interaction between phage and 
antibody, as each phage typically has up to 4 or 5 copies 
of the pill protein and each protein may carry a foreign 
peptide from the phage library. 



EXAMPLE III 

Selective Enrichment and Ch aracterization of High Affinity 
Liaands from Collections of Random Peptides on Phage 

Phage bearing peptides YGGFL and YAGFAQ served 
as models to determine the effect of IgG and Fab 
concentration on the binding and recovery of phage 
bearing high (nM Kd) and low (/iM Kd) affinity peptides. 

To determine the effect of polyvalency, a phage 
sandwich ELISA was developed which used polyclonal anti- 
phage antibodies to detect bound phage. Purified 
monoclonal antibody 3E7 was used as intact IgG and as Fab 
fragments (produced using a commercially available kit 
(Pierce), and biotinylated Fab fragments (Gramsch 
Laboratories) . No IgG was detected in Fab preparations 
when they were run on SDS-PAGE gels and stained with 
Coomassie blue. Fab was iodinated by reacting 5/ig of Fab 
in 20 jil of 0.1M borate buffer (pH 8.5) with 250 /iCi of 
[ 125 I]Bolton-Hunter reagent (Amersham) for 3 hours and 
then purified by gel filtration on S.ephadex G25. After 
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purification, the specific activity of th ["^FAB was 
approximately 15 /iCi//xg. 

Antisera were raised against phage particles 
lacking pill fAFFl, which, as described above, contains a 
frameshift in the 5' end of gene III and is produced as 
non-infective polyphage. Cells from a two liter culture 
of coli K91 were removed by centrifugation and media 
was mixed with 400 ml of 20% PEG in 0.5M NaCl. After 
incubation for 1 hr at 4 C. precipitated phage were 
isolated by centrifugation at 8500 rpm. The pellet was 
resuspended in TBS (50 mM Tris-HCl, 150 mM NaCl, pH 7.4) 
and then ultracentrifuged in a SW50 rotor at a 42,000 rpm 
for 3 hrs. The resulting pellet was resuspended in water 
and the concentration of phage was estimated according to 
the method of Day, J. Mol. Biol. 39:265 (1969). 

Three rabbits were injected intramuscularly 
with 0.5 mg of phage in Freunds complete adjuvant and 
then boosted with 0.25 mg of phage in incomplete adjuvant 
at 3 week intervals. The titer of the sera was measured 
with an ELISA using phage immobilized in Immulon 2 
tiicrotiter wells as described above. All rabbits 
produced high titer sera after the second boost. Sera 
collected after the third boost from one of the rabbits 
was used for the assays. 

Antibodies reacting with phage were affinity 
purified as follows. Phage expressing native pill (Fd- 
tet) from a two liter culture were isolated (described 
above) and added to 20 ml of sera that was diluted 4-fold 
with PBS (10 mM sodium phosphate, 150 mM NaCl, pH 7.4). 
After incubation for 2 hr at room temperature, 
phage/antibody complex was isolated by centrifugation for 
1 hr at 120,000 x g. The pellet was washed with 10 ml of 
PBS and centrifuged again. The final pellet was 
resuspended in 10 ml of 100 mM sodium acetate buffer pH 
2.5 and incubated for 10 min. at room temp. The sample 
was subjected t same centrifugation and the resulting 
supernatant was neutralized with NaOH. igG was then 
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isolated using Protein-A agarose (Pierce) according to 
the manuf acturer 1 s instructions. Th IgG was conjugated 
to alkaline ph sphatase using a commercially availabl 
kit (Pierce). 

The phage sandwich ELISAs were performed as 
follows* Microtiter wells were incubated with 100 pg/ml 
of streptavidin for 1 hr at 37 *C, then blocked with 200 
Ml of PBS /0.1% bovine serum albumin (BSA) for 1 hr* 
Biotinylated IgG (0.5 Mg/ml) or biotinylated Fab (5 
Mg/ml) (100 ml in PBS/0.1% BSA) was added to the wells 
and incubated for an additional 1 hr at room temperature. 
Preliminary studies in which immobilized IgG or Fab was 
detected with goat anti-mouse IgG conjugated to alkaline 
phosphatase indicated that these conditions maximally 
saturated the well with IgG or Fab. The difference in 
concentration for biotinylated IgG and Fab required for 
saturation of the streptavidin probably reflected 
differences in the fraction of the protein that was 
biotinylated. 

After washing the wells with PBS, 50 Ml of 
PBS/0.1% BSA/0.05% Tween 20 or the same buffer containing 
200 pM YGGFL free peptide was added to the well and 
incubated for 20 min at room temperature. Phage (10 10 
infectious particles in 50 Ml PBS/0.1% BSA/0.05 Tween 20) 
bearing the peptides YGGFL or YAGFAQ were added and 
incubated for 18 hr at 4 C. After washing with TBS/0.05% 
Tween 20, alkaline phosphatase anti-phage antibody (100 
Ml of 1:100 dilution) was added and incubated for 1 hr. 
at room temperature. After washing with TBS/0.05% Tween 
20, 100 fil of alkaline phosphatase substrate (SIGMA) in 
diethanolamine buffer (pH 9.5) was added and the 
absorbance at 405 nm was measured 10 min. later. 

Shown in Fig. 5 are the results of this assay 
when either biotinylated IgG or Fab was immobilized at 
maximal density on streptavidin coated wells. Specific 
binding was detected for both YGGFL- and YAGFAQ-phage ; 
the data indicate that th amount of binding did not 
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differ when IgG and Fab were used. In combination with 
th data on monovalent dissociation rat s of these 
peptides (see below) , this suggests that antibody binding 
sites for both IgG and Fab are in sufficient proximity to 
one another to allow simultaneous binding of more than 
one of the peptides expressed by each phage particle. 

The phage sandwich assay can also be used to 
determine the specificity and competitive nature of the 
interaction of peptide-bearing phage with immobilized 
antibody. In practice , an important aspect of the use of 
peptide on phage libraries is the characterization of 
individual phage isolates after sequential rounds of the 
affinity purification. Isolated phage may bind to other 
components found on an immobilizing surface, or may bind 
to the protein target at sites other them the active 
site. Using the phage sandwich assay, the binding of 
YGGFL- and YAGFAQ-phage was shown to be specific for the 
antibody and the interaction of the phage with antibody 
could be blocked by free YGGFL peptide. 

Shown in Fig. 6 are the results of tests on the 
effect of Fab concentration and wash time on the 
recoveries of YGGFL- and YAGFAQ-phage. Microtiter wells 
were coated with streptavidin as described above. 10 11 
infectious phage particles bearing the peptides YGGFL or 
YAGFAQ were incubated overnight at 4*C with either 50 Ml 
of 5riM or 50 pM biotinylated Fab. Aliquots were then 
added to different microtiter wells and incubated for 1 
hr. at room temperature. All the wells were washed 
quickly with TBS/0.05% Tween 20, with the last 200 /il 
wash being left in the well. At various times 
thereafter, wells washed quickly with TBS/0.05% Tween and 
the phage remaining bound were eluted with 0.1 M HC1 (pH 
adjusted to 2.2 with glycine) and guantitated by titering 
as described above. 

The results indicate that low Fab concentration 
(50pM) and dissociation times greater than 30 minutes 
allowed the selective recovery of phage bearing the 
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higher affinity peptide YGGFL. The use of a high 
concentration f Fab (5 nM) did not allow th 
discrimination of phage bearing high and low affinity 
peptides. 

Phage enrichment using a low concentration of 

frirtinvlytefl 3E7 Fab. 

A pool of phage previously isolated by three 
rounds of panning and amplification using 5 nM 3E7 IgG 
served as starting material for additional affinity 
purification and amplification using a modification of 
the previous protocol. Phage (10 11 infectious particles 
in 1 ml of TBS) were incubated overnight at 4*C with 2 ng 
of biotinylated Fab (50 pM final concentration). The 
mixture was then exposed to streptavidin-coated plates 
and bound phage were isolated as described above. 
Individual phage clones were then isolated and DNA was 
sequenced as described in Example I above. 

Shown in Table 6 are the sequences of peptide 
inserts of phage that were isolated by 2 rounds of 
affinity isolation and amplification using 50 pM of 
biotinylated Fab. A notable difference between the 
sequences shown in Table 5 and those identified in 
Example II using three rounds of affinity purification 
with 50 nM IgG, is the frequency of Phe in the fourth • 
position: 13/19 vs. 10/5i (p<0.05 Fisher Exact Test). 
Thus, the sequences more highly resemble the known high 
affinity peptides YGGFL and YGGFM. The sequence YGGFLT 
was isolated by this procedure and, of the 20 clones that 
were selected from the final pool of phage, there was 
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only one repeat (th nucleotide sequence was also 
identical) . 

TABLE 6 

n<« aoB iation tl/2 : Equilibrium. IP?0 

(minutes) (nM) 

Prmfcral Peptides 



YGGFL * 18.2 6.6 ± 3.5 

YGFWGM* 0.25 350 

YAGFAQ* ** 8300 
Peptides i solated ySt-.ti SQnM Fab 

YGAFQG 18-9 27 ± 2.0 

YGGFLT 17 . 9 

YGYWSL 15.6 

YGAFMQ 13.7 13 ± 4 . 9 

YGAFFQ 13 . 4 

YGAFFK 9.1 59 ± 22 

YGFWSN 7 . 4 

YGAFGG 5.0 

YGGFGF 4.7 65 ± 18 

YGVFSR 2.8 

YGGLSM 0.96 

YGTFLN 0.75 470 ± 140 

YGGLVR 0.50 

YGSFSL 0.43 

YGAWYT ** 1600 ± 300 

YGRFFH ** 

YGGLRH ** 

YGSFMA ** 

YGGFSP ** 



** indicates that initial binding was not detected 

rw^ATnH nation of the dissociation of T^IIFab frOTO XUSjpn 
phage clones . 

An assay employing the anti-phage antisera was 
developed t determine the rate of dissociation of 
[ 125 I]Fab from individual phage isolat s. Individual. 
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fusion phase is lates were amplified in a 5 ml liquid 
culture of £. coli K91 cells and phage particles were 
isolated and guantitated as described above. In 1 ml 
microtiter mini tubes, 25 /il of TBS containing 
approximately 2 x 10 9 infectious phage particles were 
incubated with 25 ml of TBS/0.1% BSA containing 40,000 
cpm of [^IjFab for 10 minutes. Anti-phage antisera (25 
Hi of 1:1000 dilution in PBS/0. 1%BSA) and staph A 
particles coated with goat anti-mouse IgG (25 Ml of 
Tachisorb diluted eight-fold in TBS/0.1% BSA) were then 
added. Dissociation was initiated after 2 hours further 
incubation at 4'C. To prevent binding of unbound 
[^IJFab, 25 /il of a 400 /xM solution of YGGFL in PBS/0,1% 
BSA was added to all tubes and the amount of phage-bound 
[ 125 I]Fab was determined by automated filtration on glass 
fiber filters previously treated with 1% BSA. Filter 
bound radioactivity was determined by gamma counting. 
Binding was determined in triplicate prior to and 0.5, 1, 
2, 4, 8, 16, 32 minutes after the addition of YGGFL 
peptide. The time corresponding to a 50% reduction of 
initially bound [ 125 I]Fab was determined by linear 
regression of a semi-logarithmic plot of amount bound vs. 
time. This assay was calibrated with library phage 
clones bearing peptides of known affinity (YGGFL, 7nM and 
YGFWGM, 350 nM) , as shown in Fig. 7. 

Shown in Table 6 are the tl/2 values for the 
phage clones that were picked from the pool of phage 
isolated using 5 pM Fab. Several of the clones had tl/2 
values similar to the control phage YGGFL. Specific 
binding of [ 125 I]Fab was not detected for 5 of the 20 
clones that were examined. 

For a series of related ligands, the rank order 
of dissociation rates should correlate with the rank 
order of equilibrium binding constants (Kds) . This 
correlation was confirmed and a quantitative relationship 
established between dissociation rates and the Kd of the 
corr sponding free peptide. In addition, the affinity 
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requirements for sel ction using low Fab cone ntrations 
ver stablished . 

Sev ral peptides corresponding to those phage 
clones with differing dissociation rates were chemically 
synthesized and their potencies were determined in a 
solution competition assay. The tl/2 values correlated 
with the IC50 of the corresponding free peptide. Under 
the conditions of the competition assay (low 
concentration of tracer, <20% bound tracer , 18 hr. 
incubation) , the IC50 should approximate the Kd. For 
phage bearing peptides with Kds greater than 500 nM, 
specific binding was not detected under these monovalent 
assay conditions. 



EXAMPLE IV 
Conotoxin Peptide Libraries Having 
conserved Disulfide Frameworks, 

A conotoxin peptide library is prepared as 
generally described above, by synthesizing 
oligonucleotides containing degenerate codons of the NNK 
(or NNS) motif. Here N is eguimolar A,C,G, or T, and K 
is eguimolar G or T (S=G or C) . This motif codes for all . 
20 amino acids at each locus in the hypervariable 
regions. (Alternatively, the degenerate portion can be 
assembled by the condensation of 20 activated 
trinucleotides, one for each amino acid.) The six 
cysteine codons are preserved to .produce the 
characteristic conotoxin frameworks. 

To sample additional diversity in the peptide 
libraries, the number of residues between the Cys's is 
varied. This is accomplished as follows: 

(1) Five separate oligonucleotide synthesis 
columns are prepared with the first nucleotide 
immobilized on resin. (2) The common regions of the 3 1 
end of the oligonucl otides is synthesized (all columns 
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g through the same cycles to produce the cloning site, 

tc. , on this end) • Synthesis on all columns is carried 
out through the first Cys (or CysCys) of the cono- 
framework. (3) On column l, two degenerate codons are 
5 synthesized; on column 2, three degenerate codons are 

synthesized, on so on. Each column now has 
oligonucleotides with either 2,3,4,5, or 6 degenerate 
codons in the first hypervariable region. (4) One cys 
codon is now added to all columns (this is the second Cys 

10 of the omega class or the third Cys of the mu class) . 

(5) The resins from all five columns are removed, mixed 
well, and reallocated among the five columns. Each 
column now contains oligonucleotides with all five 
lengths of first hypervariable region. (6) Each column 

15 is again put through either 2,3,4,5, or 6 cycles of 

degenerate codon synthesis as before; and the next Cys 
codon (or CysCys for omega) is added. (7) The resins 
are again removed, mixed, and redistributed to the five 
columns, and the process is repeated through three (for 

20 mu) of four (for omega) hypervariable regions. (8) The 

common sequence on the 5' end of all the oligonucleotides 
is synthesized, and the oligonucleotides are removed from 
the resins and purified as usual. 

Folding of the peptides to achieve 

25 biological activity may be directed by a 40 amino acid 

conserved "leader peptide" at the N-terminus of the 
pretoxin molecule. Synthesized as part of a recombinant 
fusion protein, this leader may enhance the folding of 
many of the members of the library into the "correct" 

30 conotoxin-like framework. Alternatively, allowing the 

cysteine framework to form in a random manner produces a 
variety of structures, only some of which mimic the 
conotoxin framework. This collection provides additional 
multi-loop structures that add to the diversity of the 

35 peptide library. 

To minimize the possibility that on 
conformation would pred minate, a gentle reduction of the 
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phag in vitro is employ d, follow d by mild oxidation to 
form most f th conformations. Mild reduction/ oxidation 
can be accomplished by treatment with 0.2 to 5 mM DTT 
followed by extensive dialysis to non-reducing 
conditions. A regenerable, immobilized lipoic acid 
column to rapidly pass the peptide-bearing particles over 
can also be used. 

The possibility of promiscuous binding of Cys 
residues in the peptide binding to other proteins can 
also be minimized by mild reduction and oxidation, or can 
be avoided by re-engineering the fusion protein by site- 
directed mutagenesis to remove the Cys residues. 

Peptides with the conotoxin framework can be 
expressed in several types of libraries as described 
herein. For example, the peptides can be 1) expressed in 
an N-terminal library in phage fAFFl; 2) expressed 
internally, fused to pill at or near the N-terminus, 
displacing the degenerate peptides 2 to 10 or more 
residues from the cleavage point to circumvent processing 
problems; 3) expressed in a carboxy terminal exposed 
library (as many of the conotoxins are C- terminally 
amidated, residues with amino side chains can be added to 
the C-terminal end of the peptides, or the peptide 
library, can be amidated in vitro ) ; and 4) the putative 
40 residue "folding peptide" can be installed upstream of 
degenerate peptides displayed in the C-terminally exposed 
configuration . 

This general format for using the secondary 
framework structure of conotoxins can also be applied to 
other peptide families with biological activities as a 
basis for designing and constructing peptide 
expression/screening libraries in accordance with the 
present invention. 

Although the foregoing invention has been 
described in some detail by way of illustration and 
example for purposes of clarity of understanding, it will 
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be appar nt that certain changes and modifications may be 
practiced within th scope of the appended claims. 
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WHAT TS CLAIMED IS: 

1. A method for identifying peptides of interest 
which bind to a preselected receptor, comprising: 

transforming host cells with a bacteriophage 
expression vector which comprises an oligonucleotide 
library of at least about 10 6 members which encode 
peptides, wherein a library member is joined in reading 
frame to the 5 1 region of a nucleotide sequence encoding 
an outer structural protein of the bacteriophage; 

cultivating the transformed host cells under 
conditions suitable for expression and assembly of 
bacteriophage; 

contacting bacteriophage that express the peptides 
to the preselected receptor under conditions conducive to 
specific peptide-receptor binding; and 

selecting bacteriophage which bind to the receptor 
and therefrom identifying the peptides of interest. 

2. The method of claim 1, further comprising the 
step of determining the nucleotide sequence encoding the 
peptide of interest in the selected bacteriophage. 

3. The method of claim 1, wherein the selected 
bacteriophage are propagated and the contacting and 
selecting steps are repeated to enrich for bacteriophage 
which express the peptides of interest. 

4. The method of claim 3, wherein the valency of 
the specific peptide-receptor binding interaction is 
reduced in subsequent repetitions of the contacting, 
selecting and propagating steps to enrich for peptides of 
higher binding affinity. 

5. The method of claim 1, wherein the bacteriophage 
expressing peptides and rec ptor are present at 
concentrati ns that produce a substantially monovalent 
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binding int raction between the receptor and 
bacteriophage which express peptides of interest. 

6. The method of claim 5, wherein the contacting 
step under substantially monovalent binding conditions is 
repeated at least once between additional rounds of 
propagation of the selected bacteriophage. 

7. The method of claim 6, wherein the receptor is 
monovalent. 

8. The method of claim 7, wherein the monovalent 
receptor is a Fab fragment of an antibody. 

9. The method of claim 1, further comprising 
determining a dissociation rate for the peptide of 
interest and the receptor under substantially monovalent 
peptide-receptor binding conditions and therefrom 
determining the binding affinity of the peptide of 
interest for the receptor. 

10. The method of claim 1, where the receptor is 
bound to a solid phase and the selected bacteriophage are 
separated from the culture. 

11. The method of claim 10 f wherein said receptor 
is an antibody or binding fragment thereof. 

12. The method according to claim 1, wherein the 
outer protein is a bacteriophage coat protein. 

13. The method of claim 1, wherein the 
bacteriophage encoded by the expression vector is a 
filamentous phage. 

14. The method of claim 13, wherein the filamentous 
bacteriophage is fl, fd, or M13. 
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10 



15. Th roethod of claim 14 f wherein the 
bacteriophage is fd or a derivative ther of. 

16. The method of claim 15, wherein the outer 
bacteriophage protein is a coat protein. 

17. The method of claim 16, wherein the coat 
protein of the fd bacteriophage is pill. 

18. The method of claim 1, wherein the 
oligonucleotide library comprises a series of codons 
encoding a random collection of amino acids. 

15 19. The method of claim 18, wherein the codons 

encoding the collection of amino acids are represented by 
(NNK) x or (NNS) a , where N is A, C, G or T, K is G or T, S 
is G or C, and x is from 5 to 8. 

20 20. The method of claim 19, wherein the series of 

codons encoding the random collection of amino acids of 
the oligonucleotide library member encodes a hexa- 
peptide . 

25 21. The method of claim 19, wherein x is 8 and the 

recombinant bacteriophage screened in the selecting step 
represents up to about 10% of the possible octapeptides. 



30 



35 



22. The method of claim 18, wherein the 
oligonucleotide library member further encodes at least 
one spacer residue. 

23. The method of claim 22, wherein a spacer 
residue comprises Gly. 

24. The method of claim 23, wherein the spacer 
comprises Gly-Gly. 
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25. Th method f claim 18, wherein the 
oligonucleotid library is flank d by nucleotide 
sequences that encode conserved residues that comprise 
structural frameworks for peptides of interest. 

26. The method of claim 25, wherein the flanking 
sequences encode Cys residues. 

27. The method of claim 26 , wherein the Cys 
residues flank the N- and C~ terminals of the peptide of 
interest. 

28. The method of claim 26, wherein at least one 
conserved Cys residue is encoded within the library 
variable region* 

29. The method of claim 25, wherein the structural 
framework comprises that of a conotox in-like peptide and 
the conserved residues are Cys. 

30. The method of claim 29, wherein the sequence of 
the structural framework comprises Cys-Cys-Y-Cys-Y-Cys- 
Cys or Cys-Y-Cys-Y-Cys-Cys-Y-Cys-Y-Cys , wherein Y is . 
(NNK) x or (NNS) x , where N is A, C, G or T, K is 6 or T, S 
is G or C, and x is from 2 to 6. 

31. The method of claim 18, wherein the variable 
codon region is prepared from a condensation of activated 
trinucleotides • 

32. The method of claim 18, wherein fragments of 20 
to 100 basepairs of a gene which encodes a known ligand 
for the preselected receptor are cloned into the 
oligonucleotide library. 



57 



33. Th method of claim 1, wherein the h st cells 
are transform d by electroporation. 

34. The method of claim 1, wherein the 
oligonucleotide library comprises at least about 10 8 
members. 

35. The method of claim 1, wherein the 
oligonucleotide library members are inserted in the 
bacteriophage expression vector so that the N-terminus of 
the processed bacteriophage outer protein is the first 
residue of the peptide. 

36. The method of claim 1, wherein the 
bacteriophage protein is a preprotein which is processed 
by the host cell to leave the peptide encoded by an 
oligonucleotide library member exposed at the N-terminus 
of the mature outer structural protein. 

37. The method according to claim 36 r wherein the 
peptide comprises spacer amino acid residues are encoded 
by the oligonucleotide library members between the N- 
terminus of the mature outer protein and the c-terminus 
of the peptide. 

38. A method for identifying peptides of interest 
which bind to a preselected receptor, comprising: 

transforming host cells with a bacteriophage 
expression vector which comprises an oligonucleotide 
library which encodes peptides, wherein a library member 
is joined in reading frame with a nucleotide sequence to 
encode a fusion protein, wherein the library member 
represents the 5 % member of the fusion protein and the 3 1 
member comprises at least a portion of an outer 
structural protein of the bacteriophage; 

cultivating the transform d cell under conditions 
suitable for expression and assembly of bacteriophage; 
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contacting bacteriophage that express the peptides 
to the pres 1 cted receptor under conditions conducive to 
specific peptide-receptor binding; and 

selecting bacteriophage which bind to the receptor 
and therefrom identifying the peptides of interest. 

39* A composition comprising a peptide produced 
according to the method of claim 1, 13, 19, 31 or 38. 

40. The composition of claim 39, wherein the 
peptide binds an antibody. 

41. An oligonucleotide library produced according 
to claims 1, 13, 19 , 31 or 38. 

42. A host cell transformed with a bacteriophage 
expression vector which comprises an oligonucleotide 
library member, joined in reading frame to the 5» region 
of a nucleotide sequence encoding an outer structural 
protein of the bacteriophage, wherein the library member 
encodes a peptide of at least about five to twenty-five 
amino acids. 

43. The host cell of claim 42, wherein the 
oligonucleotide library member comprises a series of 
codons encoding a random collection of from five to eight 
amino acids. 

44. The host cell of claim 43, wherein the 
oligonucleotide library member further comprises a 
sequence encoding at least about one to five spacer amino 
acids which are expressed adjacent to the random 
collection of amino acids. 

45. A collection of filamentous bacteriophage 
having a peptide on the N-terminus of a coat pr tein, 
wherein th peptide is coded for by a oligonucleotide. 
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library member from a randomly generated mixture of 
oligonucleotides . 
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